The reliability of estimated confidence intervals for classification error rates when only a single sample is available
نویسندگان
چکیده
Error estimation accuracy is the salient issue regarding the validity of a classifier model. When samples are small, training-data-based error estimates tend to suffer from inaccuracy and quantification of error estimation accuracy is difficult. Numerous methods have been proposed for estimating confidence intervals for the true error based on the estimated error. This paper surveys proposed methods and quantifies their performance. Monte Carlo methods are used to obtain accurate estimates of the true confidence intervals and compare these to the intervals estimated from samples. We consider different error estimators and several proposed confidence-bound estimators. Both synthetic and real genomic data are employed. Our simulations show the majority of the confidence intervals methods have poor performance because of the difference of shape between true and estimated intervals. According to our results, the best estimation strategy is to use the 10-time 10-fold cross-validation with a confidence interval based on the standard deviation. & 2012 Elsevier Ltd. All rights reserved.
منابع مشابه
Area specific confidence intervals for a small area mean under the Fay-Herriot model
‎Small area estimates have received much attention from both private and public sectors due to the growing demand for effective planning of health services‎, ‎apportioning of government funds and policy and decision making‎. ‎Surveys are generally designed to give representative estimates at national or district level‎, ‎but estimates of variables of interest are oft...
متن کاملSequential-Based Approach for Estimating the Stress-Strength Reliability Parameter for Exponential Distribution
In this paper, two-stage and purely sequential estimation procedures are considered to construct fixed-width confidence intervals for the reliability parameter under the stress-strength model when the stress and strength are independent exponential random variables with different scale parameters. The exact distribution of the stopping rule under the purely sequential procedure is approximated ...
متن کاملبررسی میزان های بروز عوارض ریوی و خطر نسبی عوامل موثر بر آن در مصدومین شیمیایی گاز خردل جنگ تحمیلی عراق علیه ایران
Sulfur mustard is the most widely used chemical agent in the Iran-Iraq war .In a retrospective cohort study of 1337 soldiers with a history of sulfur mustard exposure, factors such as age, smoking habits, number of exposure episodes and the use of gas masks were determined, together with an assessment of their relationship with the occurrence of long-term pulmonary complications. The models sug...
متن کاملA confidence-aware interval-based trust model
It is a common and useful task in a web of trust to evaluate the trust value between two nodes using intermediate nodes. This technique is widely used when the source node has no experience of direct interaction with the target node, or the direct trust is not reliable enough by itself. If trust is used to support decision-making, it is important to have not only an accurate estimate of trust, ...
متن کاملInference on Pr(X > Y ) Based on Record Values From the Power Hazard Rate Distribution
In this article, we consider the problem of estimating the stress-strength reliability $Pr (X > Y)$ based on upper record values when $X$ and $Y$ are two independent but not identically distributed random variables from the power hazard rate distribution with common scale parameter $k$. When the parameter $k$ is known, the maximum likelihood estimator (MLE), the approximate Bayes estimator and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pattern Recognition
دوره 46 شماره
صفحات -
تاریخ انتشار 2013